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?3 ! Abstract 

A family of sets F (and the corresponding family of 0-1 vectors) is called i-cancellative if for 
all distict i + 2 members Ai, . . . ^At and B,C £ J- 

. AlU ■ ■ - U AtU B ^ AlU .. .AtUC. 

m , 

0^ . Let c{n,t) be the size of the largest i-cancellative family on n elements, and let Ck{n,t) denote 

the largest /c-uniform family. We significantly improve the previous upper bounds, e.g., we show 
CO . 

{•"•^ , c(n, 2) < 2°''^^^" (for n > no)- Using an algebraic construction we show that C2k{n,2) = Q{n'') 

for each k when n — >■ oo. 



^ ; 1 Introduction, definitions 



There are many instances in Coding Theory when codewords must be restored from partial informa- 
tion like defected data (error correcting codes) or some superposition of the strings (these can lead 
to Sidon sets, sum-free sets, etc). A family of sets J- (and the corresponding family of 0-1 vectors) is 
called cancellative if A and AU B determine B (in case of A, B £ and A ^ AU B). For a precise 
definition we require that for all A, B,C G-F, A ^ B, A ^ C 

AUB = AUC => B = C. 
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Let c(n) be the size of the largest cancellative family on n elements, and let Cfc(n) denote the size 
of the largest fe-uniform family. This definition can be extended (see above in the abstract). In this 
paper we focus on 2-cancellative r-uniform hypergraphs, i.e., families of r-sets, and on 2-cancellative 
codes, where there is no restriction on the sizes of the hyperedges. 

Speaking about a hypergraph F = {V,T) we frequently identify the vertex set V = V{¥) by the 
set of first integers [n] := {1, 2, . . . , n}, or elements of a q'-element finite field Fg. To shorten notations 
we frequently say 'hypergraph J^' (or set system T) thus identifying F to its edge set T. The degree, 
degf{x), of an element x G [n] is the number of hyperedges of containing x. The hypergraph is 
uniform if every edge has the same number of elements, r-uniform means |-F| = r for all F ^ T. An 
r-uniform hypergraph (V, T) is called r-partite if there exists an r-partition of V, V = ViL) ■ ■ - UVr, 
such that \FriVi\ = l for all F £ T, i £ [r]. 

Let f{n. Pi, P2, . . . ) denote the maximum number of subsets which can be selected from {1, . . . , n} 
satisfying all the properties Pi,P2, .... With this notation c{n,t) := /(n,t-CANC), where t-CANC 
stands for t-cancellativeness. 

A hypergraph is linear, if lETl -F| < 1 holds for every pair of edges. An (n, r, 2)-packing is a linear 
r-uniform hypergraph "P on n vertices. Obviously, 17^1 < (2)/ (2)' here equality holds, then V is 
called an S{n, r, 2) Steiner system. 



2 Cancellative and locally thin families 

The asymptotics of the maximum size of a cancellative family was given by Tolhuizen [44] (construc- 
tion) and in [20] (upper bound) that there exists a C > such that 

1.5" < c(n) < 1.5" 

holds. The problem was proposed by Erdos and Katona [27] who conjectured c(n) = 0(3"/^) which 
was disproved by an elegant construction by Shearer showing c{3k) > kS^^"^ leading to c(n) > 1.46" 
for n > uq. Since a kind of product of two cancellative families is again cancellative we have 
c(n + m) > c{n)c{m). Thus the limc(n)-'^/" obviously exists. This is not known for 2-cancellative 
hypergraphs, so Korner and Sinaimeri |33j introduced 

t(4) := limsup — log2 c(n, 2) 
n— ^00 n, 

and proved 0.11 < t(4) < 0.42. As usual all logarithms have base two. The lower bound follows from 
a standard probabilistic argument. We will show that t(4) < log2 5 — 2 = 0.3219 . . . 

Theorem 2.1 c(n, 2) < 9y/n 
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The proof is postponed to Section [H Without loss of generahty we can suppose that the n- 
set underlying is [n]. We associate to every subset A ^ J^, its characteristic binary vector, 
X := x(^) = (xi, . . . , Xn), with Xi = 1 if i £ A and Xi = otherwise. One can immediately see that 
requiring the family to be t-cancellative is equivalent for its representation set of binary vectors 
to satisfy the following: for every {t + 2)-tuple {'x^^\x.^'^\ . . . jX^*"*"^^) of distinct vectors in the set 
(considered in an arbitrary but fixed order) there exist at least t + 1 different values of A; G [n] , such 
that the corresponding ordered {t + 2)-tuples {x^f!'\x^^\ . . . , x^^^'^^) are all different while for each of 
them we have the sum + x^^'^ + • • • + = 1. In hypergraph language, at least t + 1 of the 

sets among the t + 2 are having degree one vertices. This problem can be seen in a more general 
context. We can require that for every ordered a-tuple of vectors in the set, there exists at least b 
different columns, which sum up to 1. (obviously 1 < 6 < a). 

t{a,b) := lim sup — log2 /(^^, locally (a,6)-thin). 

n— >oo n 

We have t{a, 1) > t{a,2) >■■■> t{a,a). 

This problem was investigated by Alon, Fachini, Korner, and Monti OHKTH], they proved that 
t(4, 1) < 0.4561 . . . and t{a, 1) < 2/a for all even a and 

n{-) < t{a,l) < O(i^) < 0.793 
a a 

for all a. This is a notoriously hard problem. In particular, one does not even know whether 
t(3, 1) < 1 (see, Erdos and Szemeredi jl7j). 

Concerning one of the most interesting cases, the case a = 4, a locally (4, l)-thin family is also 
weakly union-free (AU B = C L) D implies {A, B} = {C, D}). The best upper bound 

log2 f{n, weakly union-free) < (0.4998 • • • + o(l)) n 

is due to Coppersmith and Shearer [9]. Nothing nontrivial is known about t(4, 2). Our Theorem 12.11 
implies i(4, 3) < log2 5 — 2 = 0.3219 . . . One can find more similar problems in the survey article by 
Korner |31) and in the more recent paper by Korner and Monti 



3 Cancellative and cover-free families 

A family T C 2["'l is g-cover-free if it is locally (g + l,g + l)-thin. In other words, for arbitrary 
distinct members Aq, Ai, . . . , Ag £ T 

A^<l{]A,. 

i=\ 

Let C{n,g) {Cr{n,g)) be the maximum size of a g(-cover-free n vertex code (r-uniform hypergraph, 
resp.). Obviously, C{n,g) < C{n, g — 1) < ... C(n, 1) and Cr{n, g) < Cr{n, g — 1) < . . . Cr{n, 2). 
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Union free and cover free families were introduced by Kautz and Singleton [28j. They studied 

binary codes with the property that the disjunctions (bitwise ORs) of distinct at most (7-tuples of 

codewords are all different. In information theory usually these codes are called superimposed and 

they have been investigated in several papers on multiple access communication (see, e.g., Nguyen 

Quang A and Zeisel [1], D'yachkov and Rykov [llj, Johnson |26]). The same problem has been 

posed - in different terms - by Erdos, Frankl and Fiiredi [13^ [14] in combinatorics, by Sos [43] in 

combinatorial number theory, and by Hwang and Sos [25] in group testing. For recent generalizations 

see, e.g., Alon and Asodi [2j, and De Bonis and Vaccaro [TO]. D'yachkov and Rykov |11] proved that 

here are positive constants ai and 02 such that 

1 logC{n,g) log 5 

< < 02 — 75- (1) 

9 IT' 9 

holds for every g and n > riQ^g). One can find short proofs of this upper bound in |23j and in 
Ruszinko [35] . 

Using induction on t we extend Theorem 12.11 for all t > 2. 
Theorem 3.1 There exists an absolute constant a > such that 



c{n,t) < an(*^i)/2 



t + 3^ " 



.t + 2. 
holds for all n,t > 2. 

The proof is postponed to Section [71 This might give a decent upper bound for small t but the true 
order of magnitude of c(n, t) is much smaller. 

Theorem 3.2 There exist positive constants (3i and 1^2 and a hound no{t) depending only from t 
such that the following bounds hold for all n > nQ{t) , t > 2 

< i^^^^ < (2) 

Proof: The lower bound is a standard random choice and it follows from ([1]) since c(n, t) > 
C{n,t + 1). For the upper bound we observe that 

c{n,t)<l+[^\+C{n,[i\). (3) 

Indeed, if C 2'"] exceeds the right hand side, then one can find h + 1 distinct members 
Aq, Ai, . . . , Afi £ J^, where h = [t/2\ , such that Aq C Ai U • • • U A^. Then, the size of the fam- 
ily := T \ {Aq, Ai, . . . , Ah} still exceeds C{n,h) so there is another set of distinct members 
Bq, . . . , Bh G J-' with i?o C i?i U • • • U Bh- Taking another set D € -F' if t is odd, we have selected 
t + 2 distinct members of J- such that the union of t of them, namely Ai, . . . , A^ and Bi, . . . , B^ and 
eventually D, covers the other two, namely Aq and Bq. Hence cannot be 2-cancellative. 

Finally, the upper bound ^ is implied by ^ and ([TJ . □ 
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4 Three-uniform cancellative families and sparse hypergraphs 



The rest of our results concern about r-uniform cancellative families. We are especially interested in 
the case when n is large with respect to r. 

Prankl and the present author p2] determined asymptotically the maximum size of an r-uniform 
5-cover-free family showing that there exists a positive constant 7 := j{r, t) such that 

Cr{n,g) = {j + o{l)W''/>^^ (4) 

as r and g are fixed and n tends to infinity. A way to determine 7(r, t) was also described. This and 
the r-uniform version of ^ 

Cr{n,t + l) <Cr{n,t) < 1+ [^J +Cr{n, [^\). 

imply 

(7(r, t + l)- o(l))n'^i < Cr{n, t) < {^{r, [t/2\ ) + o{l))n^^ . (5) 

where the exponents are 5i := \r/{t + 1)] and 82 ■= \r/[t/2\~\. The next theorem shows that to 
obtain the true asymptotic for Cr{n,t) like the one in (jH) for Cr{n,g) is probably a very difficult 
problem even in the case r = 3. 

Brown, Erdos and Sos jT2l |8l [7] introduced the function fr{n,v,e) to denote the maximum 
number of edges in an r-uniform hypergraph on n vertices which does not contain e edges spanned 
by V vertices. Such hypergraphs are called G{v,e)-sparse (more precisely Gr{v,e)- sparse). They 
showed that fr{n,e{r — k) + k,e) = Q[n^) for every 2 < k < r and e > 2. The upper bound 
(e — 1)(^) is easy, no k-set can be contained in e hyperedges. If we forbid e edges spanned by one 
more vertices then the problem becomes much more difficult. Brown, Erdos and Sos conjectured 
that 

fr{n, e{r -k) + k + 1, e) = o(n''). 

The (6, 3)-Theorem of Ruzsa and Szemeredi (36) deals with the case (e, k, r) = (3, 2, 3), when no six 
points contain three triples. They showed that there exists an a > such that 

^2g-av^ = n2-°(i) < /3(n, 6, 3) = o{n^). (6) 

Since a G(6, 3)-sparse system is G(7, 4)-sparse we have 

/3(n,6,3) </3(n,7,4) (7) 

and Erdos conjectures that /3(n,7, 4) = o(n^). 



Theorem 4.1 



/3(n,7,4)--n<C3(n,2) < -/3(n, 7, 4) + n. 
5 2 



(8) 
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The proof is presented in Section [8l 

The (6, 3) thorem was extended by Erdos, Frankl, and Rodl |15| for arbitrary fixed r > 3, 

n2-°(^) < fr{n, 3(r - 2) + 3, 3) = o{n^), (9) 

and then by Alon and Shapira [5J n''^°^^^ < /^.(n, 3(r — /c) + A; + 1,3) = o{n^). Even the case 
k = 2, fr{n,e{r - 2) + 3,e) = o{n^), is stih open. Nearly tight upper bounds were estabhshed by 
Sarkozy and Selkow [371 EH]: fr{n, e(r — k) + k + [log2 ej , e) = o{n^) for r > k > 2 and e > 3, and 
/r(n, 4(r — /c) + /c + 1, 4) = o(n^) for the case e = 4, r > A; > 3. 



5 An upper bound for uniform families 

Theorem 5.1 For every k and n we have 

k) 



C2fc(n,2) < J 

2\k) 

To estimate C2k+i{n,2) let us consider a {2k + l)-uniform family on [n] and join the element 
(n + 1) to each hyperedge. If the original family is f-cancellative, then so is the extended family. We 
can apply Theorem 15.11 to get 

C2k^i{n,2)<^0L. (10) 

2 \ k+1 ) 

Proof of Theorem \5.1[ Suppose that is a 2A;-uniform, 2-cancellative family with the underlying 
set [n]. In case of < 3 the inequality holds, so we may suppose that > 3. Then is 
1-cancellative, too. 

Define a graph G = {V,E) with vertex set V := ('^'), i.e., the family of fc-subsets of [n]. A pair 
A, B £ V forms an edge of G if AU B £ T. Such a pair necessarily contains disjoint sets. Since 
every F £ F has ^ (^^) partitions into /c-sets we have 

We claim that \S{G)\ < |y| = (^). 

Consider four adjacent edges in G on five (not necessarily distint) vertices Vi, . . . , V5 G y{G) (in 
fact these are /c-sets of [n]) such that G £{G) (1 < i < 4) and Vi / Vi+2- If these four 

edges determine four distint sets Vi U V^+i G then the identity 

{Vi U V2) U {Vi U F5) U {V2 U F3) = (^1 U V2) U {Vi U 1^5) U (1^3 U Vi) 



yields a contradiction, since J- is 2-cancellative. By definition we have {Vi U V2) 7^ (V2 U V3) 7^ 
(V3 U V4) / (T4 U V5). We also have U V2 / V's U V4 (and by symmetry 1/2 U V3 / 14 U V5). 
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Indeed, V3 C (Vi U V2), V2 n V3 = leads to Vi = V3 what we have excluded. The last case to 
investigate is when Vi U V2 = V4 U V5 and the four edges determine exactly three sets. This leads to 
the contradiction 

(Fi U 1/2) U (1/2 U Fg) = (1^1 U V2) U 1/3 = (^4 U 1^5) U V3 

= (14 u ^5) u (^3 u V^) = {Vi U V2) U {V3 u v^). 

We conclude that G does not have such a sequence of four edges. Therefore G contains no cycles, 
neither paths of length 4, implying |<?(G)| < □ 

6 The nonuniform case, the proof of Theorem 12.11 

Suppose that is a 2-cancellative family of maximal size on the underlying set [n]. Split J- according 
the sizes of its edges J> := {F £ T : \F\ = r}. 

The sequence \/2k — l{^^)4~^ is monotone increasing for = 1, 2, 3 . . . so we obtain that (^^) ^ < 
2V2A; - 1 X 4"'^ for ah A; > 1. Using this upper bound in Theorem 15.11 we obtain 

C2.(n,2) < ^ < (^flA-HV2J-l < 4V^ x ^4"^ 
The same inequalty and pO|) give 



< ^ < (^;;)4--4V2m< 4V^x (^ + ;)4- 



k-l 



2 V fe+1 

Finally, 



c{n,2) = \F\ = J];|j;|<J^c,(n,2)=K];c2fc(n,2) +K^C2fc+i(n,2) 

r r \k>0 / \k>0 



,fc>0 ^ ^ / \fc>0 



7 The case of t-cancellative codes, the proof of Theorem 13.11 

We will define a monotone sequence < 02 < < . . . at < ■ ■ ■ which is bounded above (by a) such 
that 



c,(n,2)<o,nl'-l)/2(' 



t + 2 
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holds for every n,t >2. By Theorem 12.11 this holds for t = 2 with 02 '■= 9. Suppose that t > 3 and 
pT|) holds for t — 1. We use the following upper bound 

Cr{n,t) <c{n-r,t-l). (12) 

Indeed, if J- is an r-uniform t-cancellative family on [n], then for any Fq ^ the family {F \ : 
F £ T, F ^ Fq} is {t — l)-cancellative. Use the inequality 

r(t + 2)\ 1 ( it + 2Y+y 
which holds for every integers r > 1, t > and substitute n = r{t + 2) to (I12p . We obtain 

For n > m we have Cr(n,t)(") < Cr{in,t)(^^ ^. We obtain that the right hand side is an upper 
bound for Cr(n)/(") for every n > r(t + 2). For any given n and t this gives 

E .) s E (:) ^""-'"^(' + <- ift""-"^ - ■ 

We estimate the case n < r(t + 2) using (fT2]) again 



c.(n,t) < 2: c(n-r,i-l) < J] (^) 



n—r 



n/{t+2)<r<n r>n/{t+2) n-r<{t+l)n/ (t+2) 



< ai„in(*-2)/2(t + 2) 



t + 2 

TTT 



n(t+l)/(t+2) 



Here ( |±f ) < (t + 3)/(t + 2) so the sum of the above two displayed formula gives 



(t+l)/(t+2) 



^ ' ' - \VtT2 J \t + 2j 



The rest is a little calculation (e.g., we may suppose that n > 2{t + 2)^, otherwise our upper bound 
([TT]) for c{n,t) exceeds the same upper bound for c{n,t — 1).) □ 



8 Three-partite hypergraphs 

In this section we prove Theorem 14.11 on 3-uniform 2-cancellative families. 

Lemma 8.1 (Erdos and Kleitman [16]) Let F be an r-uniform hypergraph. Then there exists 
an r -partite F* C F with \F*\ > — □ 
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Proof of Theorem \4-l\ Suppose that is a 3-uniform G(7, 4)-sparse family with vertex set [n]. 
We claim that there exists a subfamily T-L" C Ti such that 

2 

\T-L"\ > I'Wl n and H" is linear. 

5 

First, note that if the hyperedge F £ T-L has two other edges Fi, F2 with |FnFj| = 2, then these three 
edges form a separate connected component of Ti on 5 vertices. Let T-L' be the hypergraph obtained 
from Ti after deleting two edges from each such 5-vertex component. If Fi G T-L' and there exists an 
F2 G n' with |Fi n F2I = 2 then this F2 is unique. Moreover, if |Fi n F2I =2 and IF3 n F4I = 2 hold 
for four distint sets, then Fi U F2 is disjoint to F3 U F4. Leave out an edge from each such a pair to 
obtain T-L" , which is a linear hypergraph by definition, and we left out at most (2/5)n edges of Ti. 

Second, observe that a linear, 3-uniform, G(7, 4)-sparse family Ti" is 2-cancellative. Indeed, if 
X := C \ {AU B) = D \ {AU B) ioT four distinct members {A, B, C, D} C Ti" , then X C C n D so 
\X\ < 1, and |j4 U i?| < 6, so they form a G(7, 4) family, a contradiction. If we take \T-L\ as large as 
possible, then we complete the proof of the first inequality of ([8]) as follows 

C3(n, 2) > \Ti"\ > \Ti\ -^n = fs{n, 7, 4) - ^n. 

Next, let be a 3-uniform, 2-cancellative family on n vertices. We claim that there exists a 
subfamily T' d T such that 

l-^'l ^ l-^l — 'Ti and J-' is linear. 

Indeed, leave out a hyperedge from T if it has a vertex of degree one. Repeat this until we get 
J-' <Z J- with every degree is either or at least 2. We claim that T' is linear (in case of > 4). 
Suppose not, |Fi n F2I =2, Fi, F2 S J-"', Xi is the unique element of Fi \ (Fi Ci F2). By our degree 
condition there exist Ai £ J^' , Xi G Ai, Ai ^ Fi and Ai 7^ F2. This leads to the contradiction 
^1 U U Fi = ^1 U ^2 U F2. (The case \T'\ = 3 is left to the reader.) 

Apply Lemma [8TT] to to obtain a 3-partite T* of size > (2/9)|J^'|. We claim it is G(7, 4)- 
sparse, because every 3-partite, 2-cancellative, linear family J^* is G(7, 4)-sparse. Indeed, take any 
four distict members {A, B,C,D] C F* . \i Ar\B = % then C \ (^ U 5) / (by linearity) and it 
is not equal to D\{A[J B) (by 2-cancellativeness) so the union of the four of them have at least 
8 vertices. Otherwise they pairwise have a one-element intersection. If there is a degree three, say 
A(^ B r\ C = {x}, then linearity and 3-partiteness imply that D is not covered by A U U C, so 
again their union has eight vertices (at least). If these four triples pairwise meet but their maximum 
degree is two, then C and D covers {A^ B)\[Af^B), and they have a single common vertex outside 
A\JB yielding the contradiction AVJ B C = A\J B U D . 

Finally, if we take as large as possible, then we complete the proof of the second inequality 
of ([8]) as follows 

/3(n,7,4) > > ^l^'l > Im-n) = ^(c3(n, 2) - n). □ 
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Define the hypergraphs Gq and G7 as follows on 6 and 7 vertices. 
SiGe) := {123,156,426,453}, 
■= {123,456,726,753}. 

Note that both are three-partite and the 3-partition of their vertices is unique. We have proved the 
following fact. 

Proposition 8.2 Suppose that T is a three-partite, linear hypergraph. It is 2-cancellative if and 
only if it avoids Gq, and Gj. It is G{7, 4) -sparse if and only if it avoids Gq, and Gj. □ 



9 A construction by induced packings 

According to the upper bounds in Theorem 15. II we have 

C2(n, 2) < n, C3(n, 2) < -n(n + 1), C4(n, 2) < -n(n — 1). 

6 6 

Obviously, C2(n, 2) = n — 1 for n > 3. The second inequality, although it is close to the true order 

of magnitude, is not sharp if Erdos conjecture is true, see ([6]), d?]), and Any 4- uniform Steiner 

system S{n, 4, 2) is 2-cancellative yielding the lower bound 04(71, 2) > j^n{n — 1) — 0(n) for all n. 

Theorem 9.1 C4(n, 2) = — o(n^). 

k 

Theorem 9.2 C2k{ri,k) > — o{n^). 

The proof of Theorem 19.21 is postponed to Section 110.31 for the construction giving Theorem 19.11 
we use induced packings of graphs. 

A set of graphs V := {Gi = {Vi,£i), G2 = (V2,<?2) • • • } is called a packing if they are edge- 
disjoint subgraphs of G = {V,£) (by definition Vi C V for each i). The packing V is called an 
induced packing if G restricted to Vi is exactly Gi (for all i). The induced packing V is called an 
almost disjoint induced packing into the graph G if | H | < 2 (for all i j)- It follows that 
whenever F = ViCiVj, \F\ =2, then F is not an edge of G. Shortly, any two induced graphs G[Vi\ 
and G[Vj] are either vertex disjoint, or share one vertex, or meet in a non-edge. For example, if V is 
an n-element set, n is even, V = AiL) A2L) ■ ■ ■L)An/2 where each \ Ai\ = 2 and G is the complete graph 
on V minus the n/2 edges of the perfect matching {Ai,A2, ■ ■ ■}, then £{G) can be decomposed into 
n(n — 2)/8 almost disjoint, induced four-cycles, namely those induced by Ai U Aj. 

Let if be a graph of e edges and let i{n,H) denote the maximum number of almost disjoint 
induced copies of H can be packed into any n-vertex graph. It was proved by Frankl and the present 
author that 
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Lemma 9.3 [22] For any fixed graph H with e edges one can delete o{n'^) edges of the graph 
such that the rest of the edges, the graph Ln = Ln{H), can be decomposed into {1 — o{l)) / e almost 
disjoint induced copies of H . □ 

Proof of Theorem \y.l[ The graph Hi^, for > 3, is defined as a complete graph Kf^ and (g) vertices 
of degree three, each of those connected to a different triple of V{Kk). We have that Hk has k + (3) 
vertices, (2) + 3(3) edges, and it contains (3) special K4S, those having a vertex of degree 3. Take 
any almost disjoint packing of copies of Hk, V := {H\, H^., . . . } and define a 4-uniform family F{V) 
as the vertex sets of the special K^s. Obviously 1^(^)1 = (s)!'^!- 
We claim that J-{V) is a 2-cancellative family. 

Suppose, on the contrary, that there are four distinct members A, B,C, D £ with Au BUC = 
AUBUD. Note that |F n F'| < 2 for F, F' G J". Furthermore, in the case of equality F and F' are 
generated by the same HI. 

Consider first the case when C and D are generated by the same copy of HI. C = {c, xi, X2, X3} 
D = {d,yi,y2,y3} where c and d are distinct 3-degree vertices of HI. The element c is covered by 
AuB, say c£ A. By definition, the pairs cxi, cx2, CX3 are only covered by C among the members of 
F and since there is no edge (of any Hj,) from c to D\C those pairs are not covered by any member 
oiF£T. Hence An (CUD) = {c}. Similarly, 5 n (CUL>) = {d}. Hence {AU B)n{CU D) = {c,d}. 
Since the symmetric difference CAD is contained in AU B, we obtain that it is {c,d}. This leads to 
the contradiction |C n -D| =3. 

Consider now the other case, that i ^ j and C = {c, xi,X2,X3} is generated by H^. where c is a 
3-degree vertex of H^, and D = {d, yi,y2, ys} is generated by H^ where d is a 3-degree vertex of H-j^. 
Since they come from different copies of //^ we have |C n -D| < 1. This implies that either A 01 B 
meets C in two vertices, say |An C| = 2. Then A is generated by H^ as C is, say A = {a, X2, X3, X4}. 
It follows that lAnDl < 1 so l-BPlDl = 2 and D is generated by H^ as well. The pair {xi, c} = C\A 
is covered by BUD, thus it is covered by V{Hl). This leads to the contradiction that V{Hl)riV{Hl) 
contains an edge of H^. This completes the proof that is 2-cancellative. 

For given n taking a large induced packing of -fffc's Lemma 19.31 implies that 

C4(n, 2) > ^{n, H,) Q > (1 - ^(1)) (g"^ (2) ^^^^ 
when k is fixed and n — t- 00. Let tt^ := liminf„_s.oo{c4('^; 2)/ (2)}- The lower bound (jl3p gives that 
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Since this holds for each k we obtain 714 > 1/3. 
completing the proof of 7r4 = 1/3. 
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Finally, 7r4 < 1/3 was proved in Theorem 15. 1^ 

□ 



10 The lower bound for the 2A;-uniform case 



10.1 Not vanishing polynomials 

For a set of variables X = {xi, . . . ,Xs} and < i < s the symmetric polynomial cri{X) is defined 
as Y2i(zx \i\=iTliei '^oiX) = 1- For convenience, frequently (7i{X) is defined to be for \X\ < i 
(and for i < 0). Suppose that Xi, X2, . . . ,X£ are disjoint sets of variables, \Xj\ = tj, < tj < k, 
"^jik — tj) = k. The entries of a row of the k x k matrix M{Xi, . . . ,Xi) consists of a block with 
the symmetric polynomials {ao{Xj),ai{Xj), . . . ,atj{Xj)} and O's otherwise. The rows are distinct, 
so these blocks are shifted in all possible k — tj ways. 

\ 



/ 1 ai(Xi) a2{Xi] 



M{Xi 



, Xf 







1 cTi(Xi) a2(Xi) 















00 





... CTt,{X2) 



1 ^2{X2) 



1 aiiXe) a2iXe 



o'ti {Xi 










C^t,{X2) 








V •■■ 1 (7i{Xe) a2{Xe) ■■■ at,{Xe) J 

Fact 10.1 The polynomial detM(Xi, . . . o/^ \-^j\ variables is not vanishing. 

Proof. Over any field we can substitute only I's and O's such that the matrix M becomes a lower 
triangular matrix, having only I's in the main diagonal and O's above. Namely, let x = for each 
X £ Xi. In the second block of M, in rows (A; — ti) + 1 to (fc — ti) + {k — ^2) only ai{X2) stands in 
the main diagonal, where i = k — ti. Define k — ti variables of X2 to be 1, the rest 0. 



Z. FuREDi: 2-cancellative families 13 

In general, in the j'th block, in rows {k — ti) + ■ ■ ■ + {k — tj-i) + 1 to "^KgKji^ ~ *s) define 
the variables of Xj such a way that {k — ti) + ■ ■ ■ + {k — tj-i) of them are I's and the rest are O's. 
This can be done, since {k — ti) + ■ ■ ■ + {k — tj-i) < tj. □ 

One can define the matrix in a more general setting when the blocks consist of rows of the form 
{am+i{Xj) , am+2{Xj) , . . . , a„i+k{Xj)). We can get, e.g., that the determinant of the k x k matrix M 
with Mij := crm+i+j-2iX) is not vanishing if m > 0, \X\ > m, + k — 1. 

Let > 1 be a power of a prime, F := Fy the finite field of size q. For any polynomial p{xi, . . . , Xg) 
over this field the zero set Z[p) is defined as Z := {{xi, . . . ,Xs) G F* : p{xi, . . . ,Xs) = 0}. It is well- 
known that 

Fact 10.2 There is a rj := rj{h, s) such that there is an upper bound for the size of the zero-set in 
Fq of any s variables, degree h, not identically polynomial p{xi, . . . , Xg) as 

\Z{p{xu...,Xs))\<r]{h,s)q'-\ □ 
10.2 A lemma on independent polynomials 

Let A; be a positive integer, let V := ■p<jk[F, x] be the ring of polynomials of degree at most A; — 1 

■P<fe := {oo + aix H h ak-ix^^^ : ai G F}. 

The number of such polynomials is g*^ and they form a linear space of dimension k over F. A set 
of polynomials pi{x), . . . ,pe{x) E V is called (fci, . . . , k^)- independent, where ki, . . . ,k( are positive 
integers if 

fi{x)pi{x) H h fe{x)pe{x) = 

and deg(/i) < ki for all i imply that each fi{x) is the polynomial. Equivalently, all the q^'^^ 
polynomials of the form J2 fiPi (with deg(/i) < ki) are distinct. The case all ki = 1 corresponds to 
the usual linear independentness. To stay in the space V^h we also suppose that deg(pj) + ki < k. 
Then necessarily J2i h < k. 

For Z CF there is a unique polynomial with leading coefficient 1 and roots Z, namely 

pz{x) := Y{{x- z). 
zez 

Suppose that i > 2, ki, . . . ,k£ are positive integers with ki + • • • + k£ = k and let xi, . . . , a 
sequence of elements of Fq. Define the (multi)sets Xi of size k — ki as intervals of this sequence, 
Xi := {xs : 1 < s < k - ki}, in general Xj := {xs : Y^i^Ak - h) < s < Y^i<Ak - ki)}. 
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Lemma 10.3 The polynomials pxi{x), . . . , pxdx) are {ki, . . . , ki) independent for all but at most 



sequences. 



Proof. We only consider the sequences with distinct elements. There are at most (^^~p^)q^^~^'l^~^ 
sequences with repeated entries. 

The polynomials pi,...,pi are {ki, . . . ,ke) independent if and only if the set of polynomials 
{x'^Pj{x) : < j < kj} are linearly independent. These k polynomials are linearly independent 
if and only if their coefficient matrix is nonsingular. The coefficients of pxj are the values of the 
symmetric polynomials as{—Xj), so the coefficient matrix is exactly M(—Xi, . . . , —Xi) defined in 
the previous subsection. According to Fact 110.11 its determinant is a nonzero polynomial of degree 
at most + l + 2 + -- - + (A: — 1) and with {£ — l)k variables. Then Fact \W.'2\ gives an upper bound 
for the number of sequences whose determinants vanish. □ 

Corollary 10.4 For every k there exists a qo{k) such that if q > qo{k) then there exists a Ik- 
element set S GFq such that the polynomials 

Pxix), Prix), pw{x) are {k — \X\, k — \Y\, k — \W\) independent 

for every partition of S = XUYUW, |X| + |y| + \W\ =2k, 1< \X\,\Y\,\W\, < k. 

In fact, applying the previous Lemma with £ = 3 we can see that almost all 2A;-sets, all but at 
most 0((7^'^~^) of them, have this total independentness property. □ 



10.3 The algebraic construction yielding Theorem 19.21 

Let q be the largest prime power not exceeding n/{2k). Since there are no too large gaps among 
primes we have q > n/{2k) — 0{n^^^). We also suppose that q > qo{k) used in Corollarv 110.41 We 
are going to define a 2-cancellative, 2A;-uniform family T of size q^. 

Take a set 5 C Fg of size 2k satisfying Corollarv 110.41 Our hypergraph T := T{q, S) consists of 
the graphs of the polynomials V^k restricted to S. V{T) := S x F = {{s,y) : s £ S,y £ F}, every 
p £ V defines a set F{p) := {{s,p{s)) : s £ S} and let := {F{p) : p £ V}. 

To show that J- is 2-cancelative suppose, on the contrary, that A,B,C, and D are four distinct 
members of with AuBuC = AU BU D. There are four distinct polynomials a{x),b{x),c{x), and 
d{x) £ V generating these sets, A = F{a), B = F{b), etc. 

Let W C S he the set of coordinates where C and D meet, W := {s £ S : c{s) = d{s)}. Let 
X := {x £ S\W : c{x) = a{x)}, and let Y := S \ {X U W). For x G X (x, d{x)) is not covered by 
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C neither A, so it must belong to B, h{x) = d{x). For y G y we have c(y) 7^ d{y), c{y) ^ a{y) so 
{y,c{y)) must be in B, c{y) = b{y). Considering the same y £Y the element {y,d{y)) is not covered 
by C neither B so it must belong to A, a{y) = d{y). Let us summarize: there exists a partition of 



S = W [J XUY such that 

c{w) = d{w) for w £W, (14) 

c(x) 7^ for X e X, but c{x) = a{x) and d{x) = b{x), (15) 

c(y) ^ d{y) for y € but d{y) = a{y) and c(y) = h{y). (16) 



Since c and d are distinct polynomials of degree at most k—1 we have \ W\ < k. Similarly |X|, \Y\ < k. 
These also imply that |X|, |y|, > 2 (and thus k > 3). 

By c — d is divisible by pvV) there exists a polynomial ci(x) e "P such that 

c = d + cipw where ci G "P, and deg(ci) < k — \W\. 

The first halves of (I15p and (I16p similarly imply that 

a = c + aipx where ai £ V, and deg(ai) < k — \X\, 
d = a + a2py where 02 G P, and deg(a2) < k — \Y\. 

Adding these three equations we obtain 

= cipw + aipx + a2PY- 

Then the independentness oi px, Py and pw imply ci = oi = 02 = 0, a contradiction. □ 

One might think that if we use the second halves of (llSp and (I16p then we have more constraints 
and maybe we do not really need independentness and Corollary 110.41 In fact, independentness 
is essential. The second halves only imply that b = d — aipx = c + a2PY, so can have many 
non- 2-cancellative fourtuples if S is not chosen properly. 

11 A remark on 1-cancellative uniform families 

An r-partite hypergraph is cancellative, it contains no three distinct edges with AU B = AU C. 
Considering the complete r-partite hypergraph on n vertices with almost equal parts one gets 

.n, ,n + l, ,n + r— 1, 
Cr{n) > L-J X [ J X • • • X [ J =: p,(n). (17) 

The right hand side is exactly /r^' when r divides n. An old result of Mantel, about the maximum 
size of triangle-free graphs, gives C2(n) = P2{ti) = [n^/4j. Katona ^27j conjectured and Bollobas [6] 
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proved that cs{n) = P3{n). Bollobas also conjectured that equahty holds in (fT7|l for all n > r > 4 as 
well. This was established for 2r > n > r in [20]. Sidorenko |40] proved Bollobas conjecture for r = 4. 
(There is a recent refinement of this by Pikhurko [34] )■ However, Shearer |39j gave a counterexample. 
His result implies that there exist an e > and no(r) such that Cr{n) > (1 + eY{n/rY for n > no(r), 
r > 11. The cases 5 < r < 10 are still undecided. 

It was observed in [20] that Cr{n) = 2"'"'' for 2r > n > r and if is a cancellative family of r-sets 
from an n-set and n > 2r, then 



Here we show an almost matching lower bound 



Theorem 11.1 For every n > r > 2 



where cq := nfc>i = .2887. . . 



This result is basically due to Tolhuizen [H] , although he was not interested in r-uniform hypergraphs 
and wrote it like "the rate of a cancellative code is |^|| — 1 = .5849 . . . ". His publication is not even 
reviewed in MathScinet so we briefly describe his construction. 

Proof. If M is a random m x m matrix with entries from the two-element field F2 = {0, 1}, then 

2^ 2 2™ 2"^~^ 2"* 2"^~^ 

ProbfM is nonsingular) = x x . . . x > cn. 

^ o / 2m 2™ 2"^ 2"* 

Considering {n — r) x n random matrices we obtain an (n — r) x n matrix A (over F2) containing 
at least co(^",^) nonsingular (n — r) x (n — r) submatrices. Let be the set of those r-sets F C [n] 
where the columns of A labeled by the elements of [n] \ F have full rank. We have > co("). 

Let S be the (n — r)-dimensional subspace generated by the rows of A in and let 7^ be a 
subspace of dimension r such that 5 -|- 7^ is the whole space. Decompose the n-dimensional space 
into 2^ disjoint affine subspaces 

F5= [J{S + v). 

veil 

For any set F C [n] let -F be a 0-1 vector with support F. For each v € 7^ let 

^(v) := {F : F £ F £ {S + v)}. 

We have partitioned J- into 2*" pairwise disjoint r-uniform families. Given any F £ the vectors of 
(S + v) truncated to {[n] \ F) are all distinct. Hence each J-{v) is a cancellative family. □ 

There are G(r^) non isomorphic hypergraphs consisting of three edges {A, B, C} with AU B = 
AU C. The Turdn number of the class of r-uniform hypergraphs H := {Hi, 712, ■ ■ ■} is denoted by 
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ex(n,BI). It is the size of the largest r-graph on n vertices avoiding every H €z M. The sequence 
ex(?T-, H) (") ^ is monotone decreasing, its hmit is denoted by 7r(EI). When we consider the determi- 
nation of Cr{n) as a Turan type problem, then there is a score of forbidden hypergraphs. Take only 
one of them, namely defined by three sets on 2r — 1 elements [r] := {1, 2, . . . , r}, [r — 1] U {r + 1} 
and {r, r + 1, . . . , 2r — 1}. It was proved in |21] that 

Concerning another case, for an even r when is a blown up triangle, its three edges are XUY, YUZ, 
and ZUX where \X\ = \Y\ = \Z\ = r/2 Frankl [19j and Sidorenko [411 142) showed independently 
that 7r(Tr) = 1/2. More on this see [30]. 



12 Conclusion, problems 

One of our main results is to give a better upper bound for the size of 2-cancellative codes. We 
conjecture that the upper bounds of Theorems 12.11 and 13.21 are much closer to the truth than the 
simple probabilistic lower bounds we have. This is probably true for the uniform case, see pop . too. 

Conjecture 12.1 j^fe+i-o{i) ^ c2A,.+i(n,2) = o{n''~^^) as n —)• oo and k is fixed. 

Call a code J- t*-cancellative if 

AiU---UAtUB = AiU...AtUC =^ B = C or {B,C} C {Ai,...,At} 

for every t + 2 member sequence from J^, and let c* (n, t) be the maximum size of such a code J- C 2["] . 
Obviously C{n,t) < c*{n,t) < C{n,t + 1) < c{n,t). One wonder if equality holds in some of these, 
and what other relations these functions can have. 

Using the Erdos, Frankl, and Rodl [15] estimate, see ([9]), one has 

< fr{n, 3(r - 2) + 3, 3) < Cr{n, r - 1). 

The general upper bound ([5]) for Cr{n,t) here only gives O(n^), but in this case, leaving out those 
r-sets having an own pair, one can easily prove 

Cr{n, r — 1) < 

More of these type of problems, see, e.g., [21] . 

In Section 110.31 the 2/c-partite hypergraph (with partite sets Vi,...,V2fc) has an interesting 
property. For every three members A, B, C there exits a class Vi such that ACiVi, B DVi and CCiVi 
are distinct. It is natural to ask what other small substructures can be avoided this way. 
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The proof of Theorem 13. II concerning c(n, t) presented in Section [7] actually gives a slightly better 
upper bound. With a little more calculations one can obtain an explicit bound Kt for t > 3 such that 

limsup(c(n,t))^/" < Kt < 

Many problems remain open. 
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